A Unified Transformer Framework for Group-based Segmentation: Co-Segmentation, Co-Saliency Detection and Video Salient Object Detection
نویسندگان
چکیده
Humans tend to mine objects by learning from a group of images or several frames video since we live in dynamic world. In the computer vision area, many researchers focus on co-segmentation (CoS), co-saliency detection (CoSD) and salient object (VSOD) discover co-occurrent objects. However, previous approaches design different networks for these similar tasks separately, they are difficult apply each other. Besides, fail take full advantage cues among inter- intra-feature within images. this paper, introduce unified framework tackle issues view, term as UFGS (Unified Framework Group-based Segmentation). Specifically, first transformer block, which views image feature patch token then captures their long-range dependencies through self-attention mechanism. This can help network excavate patch-structured similarities relevant Furthermore, propose an intra-MLP module produce self-mask enhance avoid partial activation. Extensive experiments four CoS benchmarks (PASCAL, iCoseg, Internet MSRC), three CoSD (Cosal2015, CoSOD3k, CocA) five VSOD (DAVIS $_{16}$ , FBMS, ViSal, SegV2 DAVSOD) show that our method outperforms other state-of-the-arts both accuracy speed using same architecture, reach 140 FPS real-time. Code is available at https://github.com/suyukun666/UFO
منابع مشابه
Saliency Detection by Selective Strategy for Salient Object Segmentation
Saliency detection is useful for many computer vision tasks including content-based image retrieval, segmentation, and object detection. However, methods on saliency detection are usually greatly affected by factors like features and segmentation results. We propose a novel selective segmentation-based saliency detection model to decrease the side effects caused by these factors. After extracti...
متن کاملSalient Object Detection and Segmentation
Automatic estimation of salient object regions across images, without any prior assumption or knowledge of the contents of the corresponding scenes, enhances many computer vision and computer graphics applications. We introduce a regional contrast based salient object extraction algorithm, which simultaneously evaluates global contrast differences and spatial weighted coherence scores. The prop...
متن کاملEfficient Co-Salient Video Object Detection Based on Preattentive Processing
Automatic video annotation is a critical step for contentbased video retrieval and browsing. Detecting the focus of interest such as co-occurring objects in video frames automatically can benefit the tedious manual labeling process. However, detecting the co-occurring objects that is visually salient in video sequences is a challenging task. In this paper, in order to detect co-salient video ob...
متن کاملThree Birds One Stone: A Unified Framework for Salient Object Segmentation, Edge Detection and Skeleton Extraction
In this paper, we aim at solving pixel-wise binary problems, including salient object segmentation, skeleton extraction, and edge detection, by introducing a unified architecture. Previous works have proposed tailored methods for solving each of the three tasks independently. Here, we show that these tasks share some similarities that can be exploited for developing a unified framework. In part...
متن کاملTemporally Object-based Video Co-Segmentation
In this paper, we propose an unsupervised video object cosegmentation framework based on the primary object proposals to extract the common foreground object(s) from a given video set. In addition to the objectness attributes and motion coherence our framework exploits the temporal consistency of the object-like regions between adjacent frames to enrich the set of original object proposals. We ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Multimedia
سال: 2023
ISSN: ['1520-9210', '1941-0077']
DOI: https://doi.org/10.1109/tmm.2023.3264883